Rank in Wordlist | Frequency | Word |
---|---|---|
1173 | 8114 | 1,5 |
3435 | 2397 | 2,5 |
5143 | 1467 | 3,5 |
5320 | 1405 | 1,2 |
5501 | 1352 | 1,3 |
6300 | 1144 | 1,4 |
6848 | 1038 | 1,7 |
7145 | 981 | 1,6 |
7307 | 954 | 1,1 |
7316 | 953 | 0,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
177781 | 9 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
2395 | 3688 | 100% |
4132 | 1913 | 10% |
4232 | 1870 | 50% |
4752 | 1614 | 20% |
5269 | 1422 | 90% |
5293 | 1415 | 80% |
5617 | 1317 | 30% |
6135 | 1185 | 5% |
7817 | 883 | 40% |
7822 | 882 | 60% |
Rank in Wordlist | Frequency | Word |
---|---|---|
2245 | 3930 | CD&V |
5230 | 1435 | S&P |
11496 | 538 | S&P500 |
17633 | 303 | H&M |
23637 | 202 | CD&V-voorzitter |
26716 | 171 | B&W |
26949 | 169 | B&B |
29249 | 151 | I&O |
30067 | 145 | C&A |
30884 | 140 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
91174 | 27 | A$AP |
106541 | 21 | A$AP Rocky |
426858 | 2 | 75-$85 |
427807 | 2 | A$M |
437910 | 2 | Ca$h |
458172 | 2 | Jen$en |
490405 | 2 | U$A |
490406 | 2 | U$D |
586761 | 2 | ‘$100 |
595096 | 1 | 0,79-$0,81 |
Rank in Wordlist | Frequency | Word |
---|---|---|
351 | 26962 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
360 | 26459 | zo'n |
845 | 11535 | z'n |
1722 | 5330 | .' |
2019 | 4398 | auto's |
2942 | 2908 | m'n |
3183 | 2637 | foto's |
3401 | 2433 | Zo'n |
4038 | 1972 | collega's |
5594 | 1327 | homo's |
5861 | 1254 | risico's |
Rank in Wordlist | Frequency | Word |
---|---|---|
21179 | 235 | Apple TV+ |
44075 | 83 | Maastricht UMC+ |
64185 | 47 | 2+2 |
80438 | 33 | 1+1 |
103553 | 22 | 3+1 |
106526 | 21 | 80+ers |
107405 | 21 | P+R |
109828 | 20 | 70+ers |
126973 | 16 | K+S |
131038 | 15 | 65+ers |
Rank in Wordlist | Frequency | Word |
---|---|---|
62706 | 49 | Sportkoepel NOC*NSF |
114707 | 19 | Sagittarius A* |
Rank in Wordlist | Frequency | Word |
---|---|---|
2014 | 4404 | en/of |
3370 | 2461 | km/u |
7665 | 902 | https://www |
7977 | 864 | t/m |
8875 | 753 | hij/zij |
8946 | 743 | 24/7 |
9454 | 690 | zijn/haar |
14992 | 378 | k/w |
15053 | 376 | km/h |
16416 | 335 | km/uur |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots